Look-ahead techniques for fast beam search

نویسندگان

  • Stefan Ortmanns
  • Andreas Eiden
  • Hermann Ney
  • Norbert Coenen
چکیده

In this paper, we present two efficient look-ahead pruning techniques in beam search for large vocabulary continuous speech recognition. Both techniques, the language model look-ahead and the phoneme look-ahead, are incorporated into the word conditioned search algorithm using a bigram language model and a lexical prefix tree [5]. The paper present the following novel contributions: We describe a method for language model (LM) look-ahead pruning which is similar to [1, 9]. We show special techniques to reduce the memory and computational requirements. These techniques are based on a compressed LM look-ahead tree. To compute the LM look-ahead tree probabilites in an efficient way, we present a backward dynamic programming scheme. We present a phoneme look-ahead pruning technique to increase the efficiency of the acoustic pruning. In particular, we refine the acoustic pruning strategy by a 1and 2-phoneme look-ahead, respectively. We report results for both look-ahead pruning methods on the the 20,000-word North American Business (NAB’94) task. As a result, the combination of bigram look-ahead and 1phoneme look-ahead reduces the search space by a factor of 10 without loss in recognition accuracy in comparison with the baseline search using a unigram language model look-ahead as described in [2]. The computational costs can be reduced by a factor of 5 on a SGI workstation (Indy R4400).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Look-ahead Techniques for Improved Beam Search

This paper presents two look-ahead techniques for large vocabulary continuous speech recognition. These two techniques, which are referred to as language model look-ahead and phoneme look-ahead, are incorporated into the pruning process of the time-synchronous one-pass beam search algorithm. The search algorithm is based on a tree-organized pronunciation lexicon in connection with a bigram lang...

متن کامل

Reducing time-synchronous beam search effort using stage based look-ahead and language model rank based pruning

In this paper, we present an efficient look-ahead technique based on both the Language Model (LM) Look-Ahead and the Acoustic Model (AM) Look-Ahead, for the time-synchronous beam search in the large vocabulary speech recognition. In this so-call stage based look-ahead (SLA) technique, two predicting processes with different hypothesis evaluating criteria are organized by stages according to the...

متن کامل

An augmented beam-search based algorithm for the strip packing problem

In this paper, the use of beam search and look-ahead strategies for solving the strip packing problem (SPP) is investigated. Given a strip of fixed width W, unlimited length L, and a set of n circular pieces of known radii, the objective is to determine the minimum length of the initial strip that packs all the pieces. An augmented algorithm which combines beam search and a look-ahead strategie...

متن کامل

Improved lexical tree search for large vocabulary speech recognition

This paper describes some extensions to the language model (LM) look-ahead pruning approach which is integrated into the time-synchronous beam search algorithm. The search algorithm is based on a lexical prefix tree in combination with a wordconditioned dynamic search space organization for handling trigram language models in a one-pass strategy. In particular, we study several LM look-ahead pr...

متن کامل

IDA*_MCSP: a fast exact MCSP algorithm

QoS routing has been shown to be NP-hard. A recent study of its hardness suggests that the “worst-case” may not occur in practice and thus there may exist a fast exact algorithm. In this paper, we deploy the idea of iterative deepening search and look ahead to design an exact algorithm for finding the shortest path subject to multiple constraints (the MCSP problem). The accuracy of look-ahead i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 14  شماره 

صفحات  -

تاریخ انتشار 1997